ASYMPTOTIC DISTRIBUTION OF THE MAXIMUM 
INTERPOINT DISTANCE IN A SAMPLE OF RANDOM 
VECTORS WITH A SPHERICALLY SYMMETRIC 
DISTRIBUTION 
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Abstract. Extreme Value theory is part and parcel of any study of 
order statistics in one-dimension. Our aim here is to consider such large 
sample theory for the maximum distance to the origin, and the related 
maximum "interpoint distance," in multi-dimensions. We show that for 
the spherically symmetric families of densities, these statistics have a 
Gumbel type limit, generalizing several existing results. We also discuss 
the other two types of limit laws and suggest some still open problems. 
This work complements our earlier study on the minimum interpoint 
distance. 



1. Introduction and Main Results 

Let Xi,X2, . . . be an independently and identically distributed (i.i.d.) 
sequence of random vectors in with a spherically symmetric distribution, 
where d ^ 2. (See Subsection 15.31 for some comments on the case d = 1; 
otherwise d ^ 2 will always be assumed.) We are interested in the maximum 
interpoint distance 

m(2) := max \Xi - XA, (1.1) 

where | • | is the usual Euclidean distance. This has previously been studied by 
several authors in various special cases, including Matthews and Rukhin |ll[ | 
(symmetric normal distribution), Henze and Klein [sj (Kotz distribution), 
Appel, Najim and Russo [li] (uniform distribution in a e.g. ball), Appel 
and Russo 0] (uniform distribution on a sphere), Mayer and Molchanov 
[l^ (e.g. uniform distribution in a ball or on a sphere). We provide here 
some general results for the case of unbounded random vectors from any 
spherically symmetric distribution, which includes the work in [ll| and 

The results for maximum distance can be considered as complementary to 
the results for the minimum interpoint distance, see e.g. Jammalamadaka 
and Janson [Q]. One important difference is that the minimum distance 
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is (typically) achieved by points in the bulk of the distribution, while the 
maximum distance is obtained by outliers. This makes the maximum dis- 
tance less useful for goodness of fit tests, but might be suitable for detecting 



outliers. Some applications are given in Matthews and Rukhin 

The maximum paii 
distance to the origin 



(2) 

The maximum pairwise distance Mn is clearly related to the maximum 



Mn := max \Xi\, (1.2) 

(2) 

We obviously have Mn ^ 2M„ by the triangle inequality, and it seems 
reasonable to guess that this bound is rather sharp; this would mean that 
the maximum distance (II. ip is attained by two vectors Xi and Xj that have 
almost maximum lengths, and almost opposite in directions. 

It is well-known, see e.g. Leadbetter, Lindgren and Rootzen that 
the asymptotic distribution of M„ (after suitable normalization) may be 
of three different types (assuming that the tail of the distribution of 
is so regular that there is an asymptotic distribution at all). The three 
types of limit distributions are known as the extreme value distributions 
and are often called Gumbel, Weibull and Frechet distributions; they have 
the distribution functions 

exp(— e~^), — oo < X < oo, (Gumbel) (1-3) 

exp(-|3;|"), -oo<a;^0, (Weibull) (1.4) 

exp(— x""), < X < oo, (Prechet) (1.5) 

where for the two last cases a is a positive parameter. 

We shall consider here the Gumbel case (which includes for example the 
normal distribution); it turns out that (under some assumptions at least) 

(2) 

Mn also has an asymptotic Gumbel distribution. The Weibull case (in- 
cluding for example the uniform distribution in a ball) was considered by 

[l^ : in this case the asymptotic distribution of M^^ also turns out to be 
Weibull, although with a different parameter. We have nothing to add to 
their results and give only a few comments in Subsection 15.11 The Frechet 
case (for example power law tails) seems to be more difficult. The limit 

(2) 

distribution for Mn is not known explicitly in this case, and we explain 



why in Subsection 15.21 

Our main result is the following theorem, whose proof is given in Sec- 
tion [3l We let for convenience X denote any random variable with the same 
distribution as the Xi. 

Theorem 1.1. Suppose that 2 and that Xi,X2, ■ ■ ■ are i.i. d. W^-valued 
random vectors with a spherically symmetric distribution such that for some 
sequences an and bn of positive numbers with bn = o{an), 

P(|X| >an + tbn) = i±^e-* (1.6) 
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as n ^ oo, uniformly for all t = tn with \t\ ^ log(a„/6„). Let Aln ^ := 
maxi^j<j<g„ \Xi — Xj\. Then, for every A G M, 

^ 2a„ - 6n log ^ - log log ^ - log c, - a) ) ^ e"^"' , 

(1.7) 

Q = (d-l)2'^-4r(d/2)/0F. (1.8) 



with 



In other words, 
(2) 

Mk'-2an d-l an an , d . , 

^ \ — Y~ 1 — 5 — — > (1-9) 

where V has the Gumhel distribution ¥{y ^ x) = e~'^ . 

Remark 1.2. In particular, since log(a„/6„) — )• 00, we assume that (|1.6p 
holds for every fixed t. This is, by a standard argument, see e.g. |lCli] , 
equivalent to 

P((M„ - a„)/6„ ^t)^ e~'~\ (1.10) 

i.e., 

Ay, (1.11) 

On 

where F has the Gumbel distribution. (This verifies our claim that we are 
dealing with the Gumbel case.) We also note that our assumption is a 
bit stronger than just assuming (jl.lip . since we require (jl.6p also for some 
t = t„ — >• 00; we expect that the difference is negligible in practice, and that 
Theorem 11.11 is likely to apply for any naturally occurring distribution that 
satisfies (jl.lip . 



Remark 1.3. As a corohary we see that typically 2M„ — M„ ' is about 
2 



^2^6n log(a„/6n); more precisely, (jl.Op and (jl.lip imply 



2M„ - M^n^ p d-l 



(1.12) 



6„log(a„/5„) 2 

It can be seen from the proof below that if we order Xi , . . . , Xn as , • • • , ^(n) 

(2) 

with Mn = 1^(1)1 ^ • • • ^ l-^(n)l5 then the probability that Mn is attained 
by a pair including tends to 0; the reason is that the other large vectors 
X(2), . . . probably are not almost opposite to ^(i)- However, if we consider 
points Xi such that is close to M„, with a suitable margin, then there 
will be many such points, and it is likely that some pair will be almost op- 
posite. There is a trade-off between what we lose in length and what we lose 
in angle, and the proof of the theorem is based on finding the right balance. 

We now give two special versions of the main result that are more conve- 
niently stated, and are most likely to be useful in applications. The proofs 
of these 2 theorems are given in Section [H 
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Theorem 1.4. Suppose that d ^ 2 and that Xi,X2, ■ ■ ■ are i.i. d. W^-valued 
random vectors with a spherically symmetric distribution such that 

P(|X| >x) = G{x) = e-9W+o(i) as x^oo, (1.13) 

for some twice differentiable function g{x) such that, as x ^ oo, 

xg'{x) oo, (1-14) 

4^1og2(x<7'(x)) ^0, (1.15) 

and that an and bn are such that, as n ^ oo, a„ — )• oo and 

5f(a„) = logn + o(l), (1.16) 

l + o(l/log(a„5'(a„))) 

bn = -■ (1.17 

g [an) 

Then ([TTl) and (^M) hold. 

Theorem 1.5. Suppose that d ^ 2 and that Xi,X2, ■ ■ ■ are i.i.d. W^-valued 
random vectors with a spherically symmetric distribution with a density 
function /(x) such that, as |x| — oo, 

/(x) ~ c|x|"e-^W^ (1.18) 

for some c, /3, 7 > and a G M. Then 

(/S^Slogi-i/Tn) • - (^27logn+ (2^^ - ^) log log n 

+ log log log n + log (^c:,/3-2("+'^)/7^-(^+3)/2^2^ ^ ^ V, 

where 

, (d - l)2'^-27r'^-V2 

and V has the Gumbel distribution. 

We give some specific examples in Section [2l and provide further com- 
ments as well as state some open problems in the concluding Section [5j 



1.1. Notation. All unspecified limits are as n — ?■ 00. In particular, Xn ~ 
yn means Xn/yn — ^ 1 as n — )• 00 (allowing also = yn = for some 

n). Convergence in probability or distribution is denoted by — ^ and 
respectively. We let x+ := max(x,0) for a; G M. 
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2. Examples 

Example 2.1. Suppose that Xi has a standard normal distribution in R'^, 
i.e., its mean is and its covariance matrix is the identity matrix so its d 
components are independent standard normals. The density function is 

fix) = (2^)"'^/2g~|x|V2^ (2.1) 

which satisfies pTTH]) with c = {2tt)-'^/^, a = 0, /3 = 1/2 and 7 = 2. Hence 
Theorem 1 1 . 5 1 yields . for d ^ 2. 

^2 log nM^ - ( 4 log nH ^ log log n+log log log n+log ^Y{d/2) ) 



This was shown by Matthews and Rukhin (ll| (with correction by ^) 



Example 2.2. Henze and Klein [5| considered, more generally, the case 
when Xi has a symmetric Kotz type distribution in M*^, d ^ 2, with density 



where 6 G M and k > 0. Theorem 11.51 applies with c = ^d/2r((j/2+b-^i) ' 
a = 2(5 — 1), (3 = K and 7 = 2, and yields 

Ak log n M^^ — ^4 log n + '^^ — - log log n + log log log n 

(ci-l)2(^-^)/^r(d/2) X^ 

^ ^ 0Fr(d/2 + 6-i)2 y 

as shown by 0]- 

Example 2.3. Suppose that Xi has a density /(x) in M'^, d ^ 2, with 

/(x) = /(|x|) ~ c|x|"e-''l''l as|x|^oo, (2.3) 
where c, /3 > and a G M. Theorem 11.51 applies with 7 = 1, which yields 

f3MP - (^2 log n + ^" + ~ ^ log log n + log log log n + log (^c'J-^^''+'^'^ c^) ^ 



3. Proof of Theorem 11.11 



Let A be a fixed real number and define two sequences r„ and s„ of positive 
numbers by 

r-n ■■= ^^-rr— log ^ - log log ^ - log q - A, (3.1) 
Sn:=5logr„. (3.2) 
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(The value of r„ is determined by the argument below, but s„ could be 
any sequence that tends to oo sufficiently slowly.) Note that r„ — )• oo and 
s-n ^ CK), and = o(rjj); furthermore, T^bn = o{an). We assume below 
tacitly that n is so large that r„ > s„ > 0, and (r„ + Sn)&n < On- 
We further for convenience let 

d-l an 

Tn-=^z—^og—; (3.3) 

Z On 

thus (|1.6p is assumed to hold uniformly for |t| ^ r„. Note that r„ + s„ ^ Tn, 
at least for n large; it suffices to consider only such n, and thus (jl.6p holds 
uniformly for |t| ^ r„ + s„. 

We shall in this section show the following, which immediately implies 
Theorem [LH since M^'' > 2o„ - r„5„ W„ = 0. 

Theorem 3.1. Let Xi, X2, ■ ■ ■ be as in Theorem li.il and let Wn be the 

number of pairs {i,j) with 1 ^ i < j ^ n such that \Xi — Xj\ > 2a„ — r„5„. 

Then W„ Po(e-^). 

We shall prove Theorem 13.11 by standard Poisson approximation tech- 
niques. However, it turns out that some care is needed, since it turns out 
that the mean does not converge in Theorem I3.lt at least in typical cases 
(where ()1.6p holds for \t\ ^ Clog(an/6„) for any constant C), KWn — t- 00; 
the problem is that while (|1.10p - (|l.lip show that the largest \Xi\ typically 
is about ttn, the unlikely event that max|Xj| is substantially larger gives 
a significant contribution to since an exceptionally large Xi is likely 

to be part of many pairs with \Xi — Xj\ > 2a„ — r„6„. (A formal proof 
can be made by the arguments below, but taking s„ to be a large constant 
times r„.) We thus do a truncation (this is where we use Sn) and define, for 
X, y G R'^, the indicator function 

/„(x,y) := l{|x - y| > 2an - Vnbn and |x|, |y| ^ On + Snbn] (3.4) 

and the corresponding sum 

VF;:= fniX^,X,). (3.5) 

(This is somewhat similar to the proofs of pjj] and ^] which also use Poisson 
approximation, but they use a decomposition with several terms.) Note that 
if /n(x, y) 7^ 0, then |x| + |y | ^ |x — y| > 2an — rnbn and thus 

< |x|, |y| ^ a„ + 

^nbn- 

(3.6) 

Remark 3.2. The fact that E Wn — >• 00 shows also that the asymptotic 

(2) 

distribution of MA is not the same as the asymptotic distribution of the 
maximum of (2) independent random variables with the same distribution 
as |Xi — X2I. This is unlike the Weibull case, see Subsection 15. 11 where 
points out that such an equivalence holds. 
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We show some lemmas; the first shows that we may consider VF^ instead 

oiWn- 

Lemma 3.3. P(W„ / W^) 0. 
Proof. We have 

F{Wn + W;,) ^ p(max|X,| > a„ + ^ nF{\X\ > a„ + s„6„) ^ 

by (jl.6p . since s„ ^ t„ and s„ — )• cxd. □ 

Lemma 3.4. Let Y and Z be two independent random unit vectors in M*^ 
such that Y is uniformly distributed on the unit sphere S'^~^ , and let be 
the angle between Y and Z . Then, as e \ 0, 

F(l + co.e<e)~ "W2'e'^-)/^. 



/^r((d + i)/2) 

Proof. By symmetry, Q has the same distribution for every Z, so we may 
assume that Z is deterministic, say Z = (1, 0, . . . , 0). In this case, if y = 
{r]i, . . . , rjd), then cos = (y, Z) = r/i; moreover, it is well-known (and easily 
seen) that rji has the density function 

g{x)=c'!,{l-X^)^''-^^l\ -1<X<1, 

where 

1/c'i = f {l-x^r-^y^ 6.x = f\l-yr-'y^-'/' = "^^^'7(^/2^'^'^ ' 
Hence, for < e < 1, 

— l+e 



[l + rii<e) = c^^ j '(l-x2)('^-3)/2dx 



„//o(c!-3)/2 

"'^ .e('^-i)/2(i + o(^)) 



{d-l)/2 

r(d/2)2(^-3)/2 . ^ 

r((d + i)/2)r(i/2)^ i^ + ^i^j;- 



^((d + i)/2)r(i/2) 

Lemma 3.5. // Y and Z are two independent random vectors in such 
that Y is uniformly distributed on the sphere \Y\ = a„ + thn and Z has 
any distribution on the sphere \Z\ = an + ubn, with \t\,\u\ ^ r„ + s„, then 
uniformly in all such t and u, 

n\Y -Z\> 2a„ - r„6„) ~ (^) ^'^'^^'(r„ + t + 
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with 



V^r((d + i)/2) v^r((d + i)/2)- 

Proof. By the cosine formula, letting Q be the angle between Y and Z, 

\Y - = + |zp - 2|y| |z|cose = + \z\f - 2\y\ |z|(i + cosG). 

Hence, by the assumption (r„ + Sn)bn = o(a„) and thus tbn,ubn = o(a„), 

\Y - Z\> 2a„ - r„6„ 

(2a„ + (t + ti)6„)^ - 2(a„ + tbn){an + + cos 6) > (2a„ - r„6„)" 

-I ^ (2a„ + (t + u)bnf - (2an - Vnbnf 

J- ~r COS , T \ / 7 \ 

2[an + tbn)[an + 
_ (4a„ + (t + n - rn)bn){t + u + rn)bn 
2(a„ + tbn){an + U^n) 

^ 26„ 

Or; 



-(r„ + t + u) (1 + 0(1)). 



The result follows by Lemma 13.41 (applied to i^/l^l and Z/\Z\] the angle 
remains the same), using again that r„6„ = o(a„); the probability is 
obviously when r„ + t + u^O. □ 

Remark 3.6. In this section we use fixed sequences a„,6„,r„,s„, but we 
note for future use that Lemma 13.51 more generally holds for any positive 
sequences with (1 + r„ + s„)6„ = o(a„). 

We define, recalling that X is any variable with X = Xi, 

Tn:={\X\-an)/bn; (3.7) 
thus \X\ = On + Tnbn and (II. 6p says that 

P(r„ > t) ~ -e-\ (3.8) 
n 

uniformly for all t = tn with |t| ^ r„, and in particular for all t with 

1^1 ^ ~t~ Sxi . 

Lemma 3.7. Suppose that the function h{t) is non-negative, continuous and 
increasing in an interval [tQ,ti], with [tQ,ti] Q [— r„,r„]. Then, uniformly 
for all such intervals [to,ti] and functions h, 

l + o(l) ^, , ^ //i(ti)e"*i\ 



E(h{Tn)l{to < T„ ^ ti}) = [ ' /i(t)e-* dt + o 



n 
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Proof. Let n = f^n ■= C{Tn) denote the distribution of r„. Then, using two 
integrations by parts (where we do the first in detail, using Fubini), 



E(/i(r„)i{io < t; ^ ti}) = [ ' h{t)dfiit) 

= h{to) ¥{to < r„ ^ ti) + / ' / dh{u) dfiit) 

= h{to) F{to <Tn^h)+ r dh{u) P(n <Tn^ h) 

J to 



'to 

rti 



\ n n / \ n n / 

^ + ""^^^ ^ /i(to)e-*« + r dh{u)e-A - ( h{to)e-'^ + d/i(u)e-*^ 



to / " V Jto 

h{ti)e- 



n \ Jto ) n 



n Jto n 
with ah 0(1) uniform in [toi^i] and h. □ 

Lemma 3.8. Let :x. be a vector in M*^ with |x| = a„ + n6„ where — r„ — Sn < 
u ^ Sn- Then, uniformly for all such x, 



E 



w;/iere c'j" := r{{d + l)/2)c'^' = 2'^'^T{d/2)/^. 

Proof. We use r„ defined by (|3.7p . and note that fn{X, x) = unless — 
Sn, see (j3.6p . Moreover, Lemma [3^5] shows that for t G ( r^ 



E(/„(X,x) I T^=t)^ 4'C-y'~'^^\r^ + t + u)t'^'\ 



uniformly in these u and t, and thus 

E/„(X,x) ~ c',"(^)^'"'^^'E(K + n + T„)J~^)/'l{-r„-Sn < T„ ^ s„}). 
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We apply Lemma 13.71 with h{t) = (r„ + n + i)^ ^^^^ and obtain 
E((r„ + U + r„)i^-')/'l{-r„ - s„ < r„ ^ 



TZ / (v> o \ Til 



n Jq \ n 

l + o(l) P .rf_iw2 ^ , ^ Ar„ + n + 5„ + l)('^-i)/2e- 



;>oo 



n Jq \ n J 

= ^^r(^) (e-+" + 0((r„ + s„ + n + l)('^-i)/2e--)) . 
and the result follows. □ 
Lemma 3.9. 

^f(Y 1 + 0(1) ,„fbn\id-m 2 + 0(1) _^ 

Efn{Xi,X2) = 5 — — r„e " = 5 — e . (3.9) 



Proof. We condition on Xi and apply Lemma 13.81 with X replaced by X2 
and u = Tn given by (|3.7p with X = Xi; this yields, recalling (j3.6p . 

E(/„(Xi,X2) I Xi) = E(/„(X2,Xi) I Xi) 

_ l+o(l)^„„/6^V'^-')/^ 



n va. 



(^e^n+T„ ^ + s„ + T„ + l)('^-i)/2e-^-)) l{-r„ - s„ < r„ ^ 

(3.10) 

Hence, 

E/„(Xi,X2) =E(E(/„(Xi,X2) I r„)) 



E(^(^e^"+^" + 0((r„ + 5„ + r„ + l)('^-^)/2e-^")j l{-r„ - s„ < r„ ^ . 

By Lemma [STl with /i(t) = e* we obtain, since s„ = o(r„) and r„ — )• 00, 
/ T r .\ 1 + 0(1) . N / 1\ 1 + 0(1) 



(3.11) 



and by Lemma O with /i(t) = (r„ + s„ + t + l)('^-i)/2, 

E((r„ + s„ + r„ + l)('^-i)/2i{_^^ _ 5„ < 7; ^ = O f j , 

and the result follows, using our choice of Vn in (j3.ip and q = (d — l)c^"/4. 



□ 
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Lemma 3.10. 

E{fn{Xi,X2)fn{Xl,Xs)) =o{n~''). 

Proof. By (ICTjl . 

E{UX,,X2) I Xi) = O (^(^y'''^^\--+^^l{-rn -Sn<Tn^ Sn}) , 

where we used (r„ + s„ + + = 0(e''"+''"+'^"). Hence, using 

Lemma [321 with h{t) = e^*, 

E(/„(Xi,X2)/„(Xi,X3)) =e(e(/„(Xi,X2) I Xif) 

and the result follows by our choice (|3.2p of s^. □ 

Proof of Theorem \3.1\ (and thus of Theorem Consider the sum de- 
fined in (j3.5p . and note that Lemma 13.91 shows EW^ — )• e~^. Moreover, the 
Poisson convergence 

<APo(e-^) (3.12) 

follows from Lemmas 13.91 and 13.101 using a theorem by Silverman and Brown 
[l^ : see also Barbour and Eagleson Q|, Barbour, Hoist and Janson [3, Theo- 
rem 2.N and Corollary 2.N.1] and Jammalamadaka and Janson Theorem 
3.1 and Remark 3.4]. 

The conclusion — ^ Po(e~'^) now follows by Lemma 13.31 □ 



4. Proofs of Theorems 11.41 and 11.51 

Proof of Theorem \1.4\ First, by (jl.lip and a„— t- oo, g'{an) > for large 
n at least, so l/g'{an) > 0. Furthermore, ang'{an) — )• oo by (jl.lip . so 
hn ~ l/g'{an) and 6„/a„ by (|1.17p . 

We will prove that (|1.6p holds, uniformly for all t with |t| ^ yllog(a„/6„), 
for any fixed A. The result then follows by Theorem II. 1[ In order to prove 
(|1.6p . we may suppose that hn = l/g'{an)', the general case (II.ITI) follows 
easily. We may also suppose that n is large. 

Thus, let ^ > be a constant and let, for x so large that xg'{x) > 1, 

, , logfxg'(x)) 
S{x) := A " 4.1 

g'(x) 



and 



Ix ■= [x — 5{x),x + Six)]. 



(4.2) 
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Since 6{x)/x — )• as x — )• oo by ()1.14p . we may assume that < 6{x) < x/2; 
hence Ix C (x/2,2x). We claim that, for large x, 



-g'{x)<g'{y)<2g\x), 



y G Ix- 



(4.3) 



To show this, assume that (j4.3|) fails for some x and let y by the point in Ix 
nearest to x where (14. 3|) fails. (If there are two possible choices for y, take 
any of the points.) Then 



1 



1 



1 



2g'{x) 



(4.4) 



g'{y) g'{x) 

By (|4.4|) and the mean value theorem, there exists z G [y, x] (if y < x) ox 
z G [x, y\ (if y > x) such that 

d 



1 



2g'{x] 



1 



dz g'{z) 



5'(^)^ 



5'(^)^ 



(4.5) 



On the other hand, 29'(^) ^ d'i^) ^ "^g'i^) by the choice of y; furthermore, 
z £ Ix so x/2 < z < 2x. Hence (jl.l5p implies, for large x, using ()1.14p . 



\9"i^)\ 



1 



g'izf ^ log'izg'iz)) ^ log^{xg'{x)) 
However, ()4.5p and ()4.6p combine to yield 



(4.6) 



1 



25(x) 



2A 



2g'{x) ~" \og^{xg'{x)) g' {x)\og{xg' {x))' 

which contradicts p.l4p for large x. This contradiction shows that 
holds for large x. 

Next, (jlJ]), and (fTTTSl) imply that, for large x, 

sup |/(y)|5(x)2 = ^^^^^^^f^^log2(x5'(x)) 

^5^2 SUp^jMlog2(y5'(y))^0 

as X — 7- oo. Consequently, a Taylor expansion yields, uniformly for \u\ ^ 
8{x), 

g{x + u) = g{x) + ug'{x) + o{l) (4.7) 

as x — )• oo. Taking x = a„ and u = tbn = t/g'{an), with \t\ ^ Alog(a„/6„) = 
j4 log(a„(7'(a„)), we have \u\ ^ 5(a„) by (|4.ip . and thus (|4.7p applies and 
shows, by (I1.16P and our choice 6„ = l/g('(a„), 

5(a„ + t6n) = g{an) + tbng'{an) + o(l) = logn + t + o(l), (4.8) 

uniformly for such t. By (jl.lSp . this yields 

P(|Xi| > a„ + t6„) = exp(- logn - t + o(l)) 

uniformly for \t\ ^ Alog(a„/6„), which is (jl.6p . The result follows by The- 
orem 11.11 □ 
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Before proving Theorem 11.51 we give an elementary lemma. 

Lemma 4.1. If I3,j > and h[x) is a positive differentiable function such 
that {logh{x)y = o{x'^~^) as x ^ oo, then 



h{y)e-^y" dy ~ (fS-fy^x^-^ hix)e-^''" 



as X ^ oo. 



Proof. Suppose first that 7 = 1. Then we assume (log/i(x))' = o(l). Let 
£(3;) := supj^^^. I (log /i(y))'|; thus e{x) — )• as x — )• 00. Furthermore, for 
t > 0, 

|log h{x + t) — log h{x)\ ^ e{x)t 

and thus 

g-(/3+.(.))t < + t)e-^(-+'^ ^ g-(/3-.(x))t 
h{x)e-P'' 

Integrating we obtain, for x so large that £(2;) < /3, 



/■oo 
JO 



P + e{x) Jo 13 - e{x) ' 

and (j4.1l) follows when 7 = 1. 

For a general 7 we change variable by y = z^/"^: 



Jxi 

The function H{z) = ^^^h{z^/"')z^/'^"^ satisfies 

(log H{z)y = (iog/iy(zi/^) •7-izi/T-i + (7-1 - i)z-' = 0(1) 

and thus the case 7 = 1 apphes and yields 

/ h{y)e-^y^ = / H{z)e-^' dz ~ ji'^ H{x^)e-f^'''' as x ^ oo, 

which is dH]). □ 

Proof of Theorem \1.5[ Let cjfi-i := 27r^/^/r(d/2), the surface area of the 
unit sphere S'^~^ in M''. By (fO^ and Lemma 1131 with h{x) = x"+'^"\ 

/>oo 

P(|Xi| > x) ~ / cr^e-^^^Wd-ir'^"^ dr ~ ct^rf_i(/37)-^x"+'^-^e-^^''. 

J X 

Hence (|1.13|) holds with 

g{x) = (3x^ - (a + d - 7) log X + log(/37/ cujd-i) ■ (4.9) 

We have 

g'{x) = p-fx"/-^ - {a + d--f)x-^, (4.10) 

/(x) = /37(7 - l)xT-2 + (a + d - 7)x-^ (4.11) 

and p.l4p - p.l5p are easily verified. 



an := P'^'^ logi/^ n + bn ('^±^_SL log log n - [a + d)T^ log /3 
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In order to have (I1.16|) we need, since g{x) ~ I3x'^ as x — )• oo by (14. 9|) . 
fln ~ log^^''' n; furthermore, (I1.17P then yields 

K ~ ~ ~ /3-i/T7"i logi/^-^ n. 

9'{an) l3-fal 

We thus choose, for simplicity, 

bn := /3-i/^7-i logi/^-^ n. (4.12) 
If u„ = O(loglogn), then, by a Taylor expansion and (|4.9p ~ (|4.1ip . 

g{(3-'^nog'/^n + Unbn) 

= g{f3-^/nog^/^n)+Unbng'{r^/nog^/''n)+o{l) 

= logn- {a + d- 7)7"^(loglogn - log/3) + log{/3-f/cu}d-i) 

+ Un + o(l). 

Hence we define 

'a + d — 7 

7 

-log7 + log(cwd_i)) (4.13) 

and find that (|1.16p holds. Furthermore, by another Taylor expansion, 

g'{an) = /3^/^7log^-i/^ n- (l + 0{log-^ n + log log n • log^^/^n)) , 

and (jl.lTp follows easily. Hence Theorem II. 41 applies and (jl.9p holds. More- 
over, by (li33D and (1^21) . 

an/K ~ 7logn, 
log(a„/6„) = log log n + log7 + o(l), 
loglog(a„/6„) = logloglogn + o(l), 

and the result follows from ()1.9p by collecting terms, with = Crfa;^_j^, 
which yields ([Lin]). □ 

5. Further comments 

5.1. Weibull type extremes. The Weibull type extreme value distribu- 
tion occurs for random variables that are bounded above; in our context this 
means that \X\ is bounded, so X takes values in a bounded set. By scaling 
we may assume that the upper endpoint of the support of \X\ is 1, so X 
belongs to the unit ball, but not always to any smaller ball. The typical 
case is 

P(|X| > x) ~ c(l -x)", X/-1, (5.1) 
for some a > in which case 

P(n^/"(1 - M„) > x) ^ exp(-cx"), (5.2) 
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which means that ci/"ni/"(M„ - 1) converges to the (negative) Weibuh 
distribution in (11. 4p . 

Mayer and Molchanov [13] show that if (jS.ip holds, then Mn^ also has an 
asymptotic Weibull distribution, with a different parameter. More precisely, 
they show the following. 

Theorem 5.1 (Mayer and Molchanov [l3])' Suppose that d ^ 2 and that 
Xi,X2, . . . are i.i.d. W^-valued random vectors with a spherically symmetric 
distribution such that (jS.ip hold for some a ^ and c > 0. Then 

p^^4/(d-i+4.)(2 _ > ^ exp(-Cd,„c2x('^-i+^°)/2], (5.3) 

with 

_ r(a + i)^r((d + i)/2) 2'^-3r(a + i)2r(d/2) 

2r((d+l + 4a)/2) V^r((d + l + 4Q)/2)- ^"^ 

Hence, n^/^*^"^"*"^") (M^^'' — 2) has, apart from a constant factor, the (neg- 
ative) Weibull distribution (|1.4p with parameter (d — 1 + 4a)/2. 

Note that Theorem 15.11 includes the case a = 0, i.e., when 1P(|X| = 1) = 
c > 0; in particular the case |X| = 1 with X uniformly distributed on the 
unit sphere. (The latter case was earlier shown by Appel and Russo [2*].) In 
the case a = 0, (|5.2p does not make sense; the asymptotic distribution of 
M„ is degenerate, since P(M„ = 1) — )• 1. 

Theorem 15.11 can easily be proved by the method in Section [Sj we sketch 
this here, omitting some details. ([12i| and [2] also use Poisson approxima- 
tion, but the details are different.) We let an '■= 1, bn '■= c~^^°'n~^/°' (with 
bn := 1 when a = 0), r„ := a;6-in-^/('^-i+^") and s„ = 0. (We take s„ = 
since no truncation is needed in this case, which simplifies the argument. 
Cf. Remark 13.21 ) We define again by (13.41) . but note that ^ a„, = 1 
always holds; thus defined by (13. Sp equals Wn and no Lemma 13.31 is 
needed. 

In the case a = 0, it is easily seen that it suffices to consider the case 
of X uniformly distributed on the unit sphere; it is then easy to show that 

Wn PO using Lemma [33] and completing the proof by 

the same theorem by Silverman and Brown [1^ as before. 

If a > we proceed as in Section [3l using Remark 13.61 and define r„ by 
(j3.7p . noting that T„ < and that 

P(T„ >t)= F{\Tn\ < -t) ~ t < 0. (5.5) 

n 

This leads to an analogue of Lemma 13.71 proved in the same way; it suffices 
to consider the interval [— r„,0] and we then obtain 

E(h{Tn)l{-rn < r„ ^ 0}) = r /i(t)a|i|°-Mt. (5.6) 
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Using ()5.6I) several times, we then prove analogues of Lemmas 13. 8^ 13.91 and 

Km and Wn Po(crf,c.c2ic^'^~^+^"^/^) follows as in the proof of Theo- 
rem [TTTl 

Remark 5.2. Since the normalizing factors in (15. 2p and (15. 3p have different 

(2) 

powers of n, 2 — MA is asymptotically much larger than 1 — M„, and thus 

(2) (2) 

2Mn - M^ ' has the same asymptotic distribution as 2 — M„ , see (|5.3p . Cf . 
Remark 11.31 for the Gumbel case. 

We have here for simplicity considered only the standard case when ()5.ip 
holds, and leave extensions to more general distributions with M„ asymp- 
totically Weibull to the reader. 

5.2. Prechet type extremes. The Frechet type extreme value distribution 
occurs for \X\ if (and only if, see [lol . Theorem 1.6.2 and Corollary 1.6.3]) 
there exists a sequence 7^ — )• oo such that 

P(|X| > X7„) ~ (5.7) 
n 

for every (fixed) x > 0; then 

7-iM„ A V, (5.8) 

where V has the Frechet distribution (|1.5p . The typical case is a power-law 
tail 

F{\X\ > x) ~ cx-° as X ^ 00; (5.9) 
in this case, (|5.7|) - (|5.8|) hold with 7„ = (cn)-*^/". 

Theorem 5.3. Suppose that 2 and that Xi,X2, ■ ■ ■ are i.i.d. W^-valued 

random vectors with a spherically symmetric distribution such that ()5.7p 
hold for some 7„ — )• 00. Then 

7-^m(2) a Z„ (5.10) 

for some random variable Z^, which can be described as the maximum dis- 
tance maxjj l^j — ^jl between the points in a Poisson point process S = 
on \ {0} with intensity acj^^jxl^""'^. 

Sketch of proof. It is easy to see that the scaled set of points {'^~^Xi : 1 ^ 
i ^ n}, regarded as a point process on R'^ \ 0, converges in distribution to 
the Poisson process H. It then follows that the maximum interpoint distance 
converges. We omit the details. (See e.g. Kallenberg [1] or [§] for details on 
point processes, or Janson 0, § 4] for a brief summary.) □ 

Note that the point process H has infinite intensity, and thus a.s. an 
infinite number of points, clustering at 0, but a.s. only a finite number of 
points 1^1 > £ for any e > 0. (This is the reason for regarding the point 
processes on M'' \ only, since we want the point processes to be locally 
finite.) 
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We leave it as an open problem to find an explicit description of the limit 
distribution, i.e. the distribution of Za- We do not believe that it is Frechet, 
so Mn and Mn will (presumably) not have the same type of asymptotic 
distribution in the Frechet case, unlike the Gumbel and Weibull cases treated 
above. 

One reason for the more complicated limit behavior in the Frechet case 
is that the Poisson approximation argument in Section [3] fails. If we define 
as there, with a suitable threshold and a suitable truncation (avoiding 
small this time), we can achieve E/„(Xi,X2) ~ Cn^'^ as in Lemma [3.91 
for a constant C > 0, but then E /„(Xi, X2)/n(^i, ^3) will be of order 
and there is no analogue of Lemma I3.10| this ought to mean that Wn does 
not have an asymptotic Poisson distribution. In other words, the problem 
is that there is too much dependence between pairs with a large distance. 

Furthermore, it can be seen from Theorem 15.31 that there is a positive 

(2) 

limiting probability that the maximum distance Mn is attained between 
the two vectors and X(2) with largest length, but it can also be attained 
(with probability bounded away from 0) by any other pair ^(fe) and with 

given 1 ^ k < I. This is related to the preceding comment, and may thus 

(2) 

also be a reason for the more complicated behavior of Mn in the Frechet 
case. (It shows also that there is an asymptotic dependence between MA 
and Mn which does not exist in the Gumbel and Weibull cases.) Moreover, 
it follows also that, again unlike the Gumbel and Weibull cases, the angle 
between the maximizing vectors Xi and Xj is not necessarily close to vr (it 
can be any angle > 7r/3), so it is not enough to use asymptotic estimates as 
Lemma 13. 4[ 

5.3. The case d = 1. The theorem above supposes d > 1, for example 
because we need r„ — )■ 00. In the case d = 1, there is a similar result, which 
is much simpler, but somewhat different; for comparison we give this result 
too. 

Theorem 5.4. Suppose that Xi,X2,... are i.i.d. symmetric real-valued 
random variables such that for some sequences an and 5„ of positive num- 
bers with bn = o(a„), <\1.6h holds as n ^ 00, for any fixed real t. Let 
(2) 

MA := maxi^i<j<;„ \Xi - Xj\. Then, 



where V\ , V2 are two independent random variables with the Gumbel distri- 
bution F{V± ^t) = e-'^"'. 

Proof. When d = 1, 




'n 



(5.11) 



where M, 



m(2) =M+-M-, 
Xi and M~ := miuj^^Xj. 



(5.12) 
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There are about n/2 positive and n/2 negative Xi. More precisely, de- 
noting these numbers by and N- = n — , where we assign a random 
sign also to any value that is 0, we have A^-f, ~ Bi(n, 1/2). Conditioned 
on and A^_, and assuming that both are non-zero, and M~ are 

independent, with M+ = M^r^ and M~ = — Mjv_. Moreover, if we assume 
(|1.6p for every fixed t, then it is easy to see that for any random N with 
N/n 1/2 as n — 7- oo, we have 

--hlog2 — >V, (5.13) 



h 



n 



cf. (II. lip . Consequently, 

^^^f^ + log2Ay„ (5.14) 

where V± are two random variables with the same Gumbel distribution; 
moreover, it is easy to see that this holds jointly with and y_ indepen- 
dent. The result follows from (iET2|) and (fSTi]) . □ 

Comparing Theorem 15.41 to Theorem 11.11 we see that first of all the limit 
distribution is different. Furthermore, the term log{an/bn) in (jl.9p disap- 
pears, which is expected since it is multiplied by (d — 1)/2, but also the term 
loglog(a„/6„) disappears, and the constant term is different, with — logQ 
replaced by 2 log 2. (q in (jl.Sp would be for d = 1, which does not make 
sense in (jl.9p .) 

Similarly, in the special case in Theorem 11.51 we obtain for d = 1 from 
(j4.13p and (j4.12p (which hold also for d = 1 by the proof above) the following, 
where the limit distribution again is different; furthermore, the log log log n 
term disappears and the constant term is slightly different. 

Theorem 5.5. Suppose that Xi,X2, ■ ■ ■ are i.i.d. symmetric real-valued ran- 
dom variables with a density function f{x) such that, as \x\ — t- oo, 

/(x) ~ clxl^e-'^l^l^ (5.15) 

for some c, /?, 7 > and a G M. Then 



(/3^/^7log^-^/^n) • mP - (^27logn+ (2'^ 2) log log n 

+ log ( /3-2("+i)/77-2c2) ^ A y+ + y_ , 



where V± are independent and have the Gumbel distribution. □ 

Typical examples are given by f{x) = (27r)^-'^/^e~^^/^ and f{x) = ^e"'^'; 
we leave the details to the reader. 

The argument above applies also to the Weibull and Frechet cases when 
d = 1; we omit the details. (Furthermore, Theorem 15.31 holds also for d = 1.) 
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5.4. Non-symmetric distributions. We have assumed that the distribu- 
tion of X is spherically symmetric. What happens if we relax that condition? 
Consider for example the case of a normal distribution with a non-isotropic 
covariance matrix, for example with a simple largest eigenvalue so that there 
is a unique direction where the variance is largest. Will the asymptotic distri- 

(2) 

bution of Mn then be governed mainly by the component in that direction 
only, so that there is a limit law similar to the case d = 1, or will the result 
still be similar to the theorems above for the spherically symmetric case, or 
is he result somewhere in between? We leave this as an open problem. 

For the case of points distributed inside a bounded set, Appel, Najim 
and Russo [if and Mayer and Molchanov [12] have results also in the non- 
symmetric case. 

5.5. Other norms. We have considered here only the Euclidean distance. 
It seems to be an open problem to find similar results for other distances, 
for example the ^^-norm or the ^°°-norm in W^. 

Acknowledgement. We wish to thank Professor Norbert Henze for bring- 
ing this problem to the attention of one of us. 
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